Efficient Decision-Theoretic Target Localization
نویسندگان
چکیده
Partially observable Markov decision processes (POMDPs) offer a principled approach to control under uncertainty. However, POMDP solvers generally require rewards to depend only on the state and action. This limitation is unsuitable for information-gathering problems, where rewards are more naturally expressed as functions of belief. In this work, we consider target localization, an information-gathering task where an agent takes actions leading to informative observations and a concentrated belief over possible target locations. By leveraging recent theoretical and algorithmic advances, we investigate offline and online solvers that incorporate belief-dependent rewards. We extend SARSOP—a stateof-the-art offline solver—to handle belief-dependent rewards, exploring different reward strategies and showing how they can be compactly represented. We present an improved lower bound that greatly speeds convergence. POMDP-lite, an online solver, is also evaluated in the context of informationgathering tasks. These solvers are applied to control a hexcopter UAV searching for a radio frequency source—a challenging real-world problem.
منابع مشابه
A Value Efficiency-Based Target Setting Approach in Data Envelopment Analysis
Basic models of Data Envelopment Analysis are intrinsically preference-free, in the sense that they consider all inputs and outputs and also all decision making units of the same importance. Although this property is beneficial in many ways, it has some drawbacks simultaneously, as the decision makers’ preferences are not taken into account in the process of evaluating units. To overcome this ...
متن کاملThree Dimensional Localization of an Unknown Target Using Two Heterogeneous Sensors
Heterogeneous wireless sensor networks consist of some different types of sensor nodes deployed in a particular area. Different sensor types can measure different quantity of a source and using the combination of different measurement techniques, the minimum number of necessary sensors is reduced in localization problems. In this paper, we focus on the single source localization in a heterogene...
متن کاملGame-Theoretic Approach for Pricing Decisions in Dual-Channel Supply Chain
In the current study, a dual-channel supply chain is considered containing one manufacturer and two retailers. It is assumed that the manufacturer and retailers have the same decision powers. A game-theoretic approach is developed to analyze pricing decisions under the centralized and decentralized scenarios. First, the Nash model is established to obtain the equilibrium decisions in the decent...
متن کاملDecision-theoretic approach to maximizing fairness in multi-target observation in multi-camera surveillance
Central to the problem of active multi-camera surveillance is the fundamental issue of fairness in the observation of multiple targets such that no target is left unobserved by the cameras for a long time. To address this important issue, we propose a novel principled decision-theoretic approach to control and coordinate multiple active cameras to achieve fairness in the observation of multiple...
متن کاملEfficiency of Target Location Scenarios in the Multi-Transmitter Multi-Receiver Passive Radar
Multi-transmitter multi-receiver passive radar, which locates target in the surveillance area by the reflected signals of the available opportunistic transmitter from the target, is of interest in many applications. In this paper, we investigate different signal processing scenarios in multi-transmitter multi-receiver passive radar. These scenarios include decentralized processing of reference ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017